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METHOD AND APPARATUS FOR GRAPHICS PROCESSING 
IN A HANDHELD DEVICE 

FIELD OF THE INVENTION 
[00011 The present invention relates generally to graphics processing and moic 

specifically to graphics rendering in a handheld device. 

BACKGROUND OF THE INVENTION 
[00021 With the growth of modem computing trends, there is an increased demand in 

portability and improved fonctionaUty of a handheld device, v/hcrein a handheld device may be, 
but not limited to, a cellular phone, a personal digital assistant (PDA), a pager, a smart phone, or 
any other suitable portable electronic device capable of providing graphical interactivity, as 
recognized by one having ordinary skiU in the art Furthermore, with the convergence of 
handheld devices having improved ftmctionality and stand alone computing systems, such as a 
desktop or laptop computer, having greater interactivity wilh the handheld device, there is a 
greater demand for improved functionality and quality of interactivily between multiple 
handheld devices and also interactivity between the handheld device and the stand alone 
computing system. 

[00031 An emer^g area in handheld devices is Ihe development of interactive video 

games to be played on the handheld device. With these graphic intensive applications, there 
exist prior art limitations with regard to graphical aichitecture for generating the graphical 
output. One common pit>blem in the handheld device is the Ihnited power and memory 
resources. Current graphics rendering techniques, including three-dimensional graphics 
rendering techniques, require an extensive amount of power to perform the various and multiple 
steps in a graphics-processing pipeline. Furtheimore. thre^dimensional graphics rendering may 
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also be memory intensive due to memory requirements for storing, among other things, vertex 
information, pixel information, and/or texture data. In modem handheld devices, due to size 
requirements and power requirements, there is an exixemely limited amount of available space 
for providing memory resources and power consumption constraints severely restrict graphics 
tendering while maintaining an extended mobile device battery life. 

10004] It would be extremely inefficient to use existing graphics processing techniques in 

modem handheld devices. For example, the handheld device includes a central processing unit 
(CPU) which controls functions for the handheld device, such as hi the example wherein the 
handheld device is a cellular telephone, but the central processor must coordinate the operations 
of the various components to allow for communication and internal operations. Available 
processing cycles within the handheld device CPU severally restricts the graphics processing 
availability and/or the quality of a rendered image. 

[00051 One solution for overcoming CPU computation limitations is the offloading of 

processing requirements to subsequent processors, such as a graphics processor, similarly found 
in devices, such as laptop computers and desktop computers. Although, a current solution is the 
performance of fixed function transformations of the rendering data, such as vertex information, 
by the CPU and triangle setup/rasterization computations by an external graphics hardware 
device, such as a graphics processor. Typically, current handheld device CPUs do not support 
floating point calculations and also have very poor transform performance, which results m the 
handheld device being unable to render an image have a high polygon number, thereby limiting 
the quality of the subsequent output display. 
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[0006] Other limitations found within cunent handheld devices are limited physical real- 

estate for placing graphics rendering engines and also Umited bandwidth avaUabiUty for 
processing the graphics rendering data. As handheld devices become more compact, there exists 
less real-estate for iJie insertion of additional processors for performing the graphics processing 
operations, such as two-dimensional or three-dimensional rendering. Furthermore, the avaUable 
bandwidth for transmitting the graphics rendering data is also limited. Among other things, size 
constraints prohibit a large data bus for graphics rendering information and therefore can 
severely restrict the processing speed and/or quality of a graphics rendering engine. 

[0007] Therefore, a need exists for a method and apparatus that overcomes power 

consumption requirements, limited memory resources, limited graphics transfomaation 
processing within the handheld device and provides for graphics rendering effectively 
augmenting the limited available resources within the mobile device. 

BRIEF DESCRIPTION OF THE DRAWINGS 
(00081 PIG. 1 illustrates a functional block diagram of an apparatus for graphics 

processing in a hamlheld device, in accordance with one embodiment of the present invention; 

[0009] FIG. 2 iUustrates a functional block diagram of an apparatus for graphics 

processing in ahandheld device, in accordance with onb embodiment of the present invention; 

(000101 FIG- 3 Ulustrates a functional block diagram of a handheld device having graphics 
processing, in accordance with one embodiment of the present invention; 

(000111 FIG. 4 illustrates a graphical representation of a bin identifier, in accordance with 
one embodiment of the present invention; 
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[00012] FIG. 5 illustrates a graphical representation of a clip identifier, in accordance with 
one embodiment of the present invention; 

100013] HGS. 6-7 illustrates a graphical representation of an original triangle and 
modified triangle relative to a cUpping region, in accordance with one embodiment of the present 
invention; 

100014] FIG. 8 illustrates a graphical representation of a graphics processor for perfomiing 
the present invention; 

[OOOIS] FIG. 9 iUustrates a flowchart of the steps of a method graphics processing in a 
mobile device, in accordance with one embodiment of the present invention; 

lOOOiei FIG. 10 illustrates a flowchart of the steps of a method for graphics processing in 
mobile device, m accordance with one embodiment of the present invention; 

[00017] FIG. 1 1 illustrates a flowchart of the steps of a method for graphics processing in 
mobile device, in accordance with one embodiment of the present invention; and 



a 
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[000181 FIG- 12 illustrates a flowchart of the steps of a method for graphics proccssmg m 
a mobile device, in accordance with one embodiment of the present invention. 

DETAILED DESCMPTION OF A PREFERRED EMBODIMENT 

[00019] Generally, the present invention includes a method and apparatus ftr graphics 
processing in a handheld device including a transform engine or vertex shader capable of 
receiving vertex information, "me tiansfbrm engine may be a fixed function transform engine 
capable of performing fixed fimction transformations on Ihe vertex information or a 
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progranunablc vertex shadenf.] Regardless thereof, the transform engine thereupon generates a 
plu^aity of vertices from the vertex information, wherein Jke^each of the vertices includes a 
corresponding bin identifier. In one embodiment, the bin identifier is a multi-bit categorization 
of the associated vertex relative to its position on the screen. 

[000201 The method and apparatus ftxlher includes a plurality of view ftamc factors which 
include fectors for defming a clipping region. The clippmg region includes the areas outside of 
the viewable region such that vertices within the dipping region.aie not visible and therefore 
may be clipped to reduce computation overhead. In one embodiment, the vertex shader 
compares each of the plurality of vertices relative to the clipping region and when one of the 
vertices is within the clipping region, a cUp identifier is generated for that vertex using the 
conesponding bm identifier. In accordance with one embodiment, the clip identifier is a multi- 
bit association of the vertices within the cUpping region. 

1000211 The vertex shader is operably cov?>led to the clipping module such that the 
method and apparatus further provides that the vertex shader provides the vertices having 
corresponding clip identifiers to the oUpping module which thereupon generates supplemental 
vertices and the vertex shader may receive the supplemental vertices therefrom. In one 
embodiment, the supplemental vertices ait> newly created vertices within the viewable region and 
outside of the clipping region. The method and apparatus finther includes a vertex buflFer 
coupled to the vertex shader, wherein vertex buffer receives and stores the plurality of vertices 
and the supplemental vertices from the vertex shader. 

[00022] As such, the present invention provides forJlgraphics processing in a handheld 
device by performing vertex transformations in two steps. The first step transforms ^'^^"i^^^^ 
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and outputs the transformed vertex buffer .vith a bin identifier and if the vertex is inside tbe 
clipping region >viites the clip identifier. In the second step the vertices axe projected on to the 
screen using the view frame factors and the triangles are rendered. By breaking the vertex 
transformation into t^vo steps this allows us to perform cUpping on the CPU instead of on the 
graphics processor. Since the percentage of vertices that need clipping represent a very smaU 
nwnber^this allows us to do clipping on the CPU withotrt introducing too much additional 
OverhSd on the CPU and keep the gale count si2« down S^^hics processor which is 
extremely important since it affects power consumption. Another scarce resource is memory 
footprint available during rendering and it is possible that there will be configurations where we 
cannot allocate an entire Z, back buffer. Ibis means that we need to break viewport rendering 
into multiple sub regions and render triangle lists into each of these regions. Since the same 
triangle Ust is being passed as input over multiple sub regions this results in wasted memory 
bandwidth since vertex data would have to be read multiple times (once for each sub-region) 
before deciding whether triangle is displayed in current sub-region and should be rendered or 
not To reduce this memory bandwidth wastage the bin identifier is used. Instead of reading the 
vertex data first, we read the bin identifiers for vertices assodated with a triangle. Hie vertex bin 
identifiers are used to determin^l^gle is displayed in current sub-region and if so then and 
then only the vertex data is read. 

[00023] More specifically, FIG. 1 iUustrates a graphics processing apparatus 100 having a 
vertex ahader 102. The vertex shader 102 may be a fixed fimction transform engine capable of 
performing fixed function transformations on incoming data, such as vertex information 106 or 
can be a programmable vertex shader. In one embodiment, the vertex information 106 U 
transfomied to generate a plurality of vertices, wherein each of the vertices has a corresponding 
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bin identifier 110, which is described in further detail below in FIG. 4. In one embodiment, the 
vertex sbader 102 includes a vertex transfortn engine 103 and a veiwport transform engine 104 
v/hich may be implemented in either hardware, software or a combination thereof for perfonning 
operations as discussed below. 

[00024] The vertex shader 102 receives a plurality of view ftame fectors 108 which 
include any associated factors for determinirg a clipping region defining the viewable area of an 
output display. For example, the view ftame factors 1 08 may include a guard band layer and the 
coordinates of a view frame frustum. As recognized by one having ordinary skill in the art. any 
other suitable view ftame factor may be utilized to define the cUpping region. 

[0002S1 nie vertex shader 102 thereupon determines if each of the vertices is withm the 
viewable region by a comparison of the associated bin identifiers 110 with the boundaries of the 
dipping region. For all vertices that are within the clipping region, a corresponding clip 
identifier generate.^. Tbe clip identifier lll^^rovides an indicati^the vertex bin 

identifier 1 10 and other information for generating supplemental verticesailpThe cHp identifie;^ 
^"nl^^Ssed in Jurther detail below with regards to FIG. 5. 

[00026] Once all of the bin identifiers 1 1 0 have been compared with the cUpping region, 
Ihe cUp identifiers 11^ provided to a clipping module 116. In one embodiment, the clip 
identifiers 1 12a may be provided to a clip buffer 113 for intermediate storage therein, wherein 
the clip identifiers 112b may be fiarther provided to the clipping module 116. In one 
«nbodiment, the clipping module 116 is a module implemented in software by a processor 
performing cUpping operations in response to executable instructions. In one embodiment, the 
cupping module 116 receives the clip identifiers llland generates supplemental vertices |ll4'\| 
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within the viewable regioa and/or outside of the cUpping region, as illustrated in ftirther detail in • 
FIGS, 6-7 below. The supplemental vertices are appended into the vertex buffer. 

[000271 After tlie supplemental vertices have been added to the vertex buffer, the triangle 
list is issued. The vertex shader 102 wUl read vertex bin identifiers 119 to determine if atriangle 
is visible in the current render region and if so the vertices are read, a viewport transformation is 
performed and the triangles are issued to a setup engine (not shown in FIG. 1). 

I00O281 In one embodiment, the vertex buffer 120 actually comprises 3 separate buffers, a 
tiansfonned vertex data buffer, a screen bm buffer and the clip buffer 1 13. In one embodiment, 
vertex identifiers index all of these buffers. The screen bin buffer includes data relating to the 
loc^on of the vertices on the screen plus the guard band region. This data may be used to do a 
fast reject of triangles having multiple vertices when rendering across multiple regions and avoid 
having to read vertex data 106. The cUp buffed 113 describes the vertex indices that refer to 
vertices that are completely outside Hie view frustum for corresponding Z planes and outside the 
guard band regions fbr corresponding X. Y planes. These represent vertices that would have to 
be cUpped by software rumiing on the CPU. These operations are performed during a first phase, 
referred to as a vertex transform phase. 

[000291 In a second phase, a viewport transform phase, software executing a processing 
unit, such as the CPU. not shown, issues a list of liiangle having vertex identifiers. The vertex 
shader 102 also executes a viewport liansfoim code, which in one embodiment is a plurality of 
executable operating instructions for performing specific viewport operations. The viewport 
operations include, but are not limited to. perspective divide, mapping to screen space and back- 
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face ciaiing. Thereupon, the vertex shader 102 may issue a pluralit>' of triangles having visible 
vertices, to a set-ap engine (not shown). 

I00030J FIG. 2 illustrates another embodiment 140 of the present invention including the. 
Hie vertex shader 102 and the clipping module 116. As described above, the vertex shader 102 
receives the vertex data 106, generates the bin identifiers 110 and uses the view frame fectors 
108 to generate the clip identifiers ll^e clipping modiji^nerates the supplemental vertices 
with chp identifiers^l^provides these vertlces\l l^^e vertex shader 102. Thereupon the 
bin identifiers and supplemental vertices 121 generated via the viewport transform engine^OS] to^ 
are provided to a se^ginefi 5^ 

(00031] HG. 2 illustritS^ pixel processhig pipeline used to process Ihe vertex 
information within the vertex buffer 120. In one embodiment, the pfacel processing pipeline 
operates in accordanc&Jajith known pbtel processing techniques using the set-up engine 142, a 
rasterizer engine ^sf^ixel operation module 146, a feme buffer 148 and a display 150. As 
described above, the vertex shader 102, upon executing the viewport transform, issues triangles 
152 to the set-up engine 142. Hie set-up engine 142 generates the primitives 154 which are 
provided to the rasterizer engme 144, wherein the rasterizer engme 144 performs rasterization 
operations thereon. Upon generating a plurality of pixels 156, the pfatel operation module 146 
may thereupon perform any pixel operation, such as scaling, as recognized by one having 
ordinary skill in the art. 

[000321 Thereupon. i^^inSdSruTprovides pixel output 158 to the frame buffer 148, 
which may be any suitable memory device as recognized by one having ordinary skill m the art. 
The frame buffer 148 receives the pixel output 158 and stores the pixel information therein. 

10 

CHICAGOfl/ 10483 153 

PA(£12O4'RCVDAT2/16/20(l51:0S:47PM [Eastern Standardrimel'SVIlUSPra^ 



02/1'6/2005 12:10 FAI 312 609 5005 \'EDDER PRICE KAUFJLAN QOIJ 



I 

Once a frame has been saved in the frame buffer, a frame 160 is provided to the display. As the 
present invention provides for graphics processing in a handheld device, the display 150 may he 
a liquid crystal display, a small flat panel display or any other suitable display as recognized by 
one having ordinary skill in the art being capable of being disposed within a handheld device. 

[00033] FIG. 3 illustrates a block diagram of a handheld device 200 including a baseband 
receiver 202 having an antenna 204. a CPU 206, a memory 208, a subdisplay 210, a graphics 
processor 212. another memory 214, an LCD control 216 and a display 218. In accordance with 
known handheld device operating techniques, the baseband receiver 202, receives and transmits 
communication information via the antemiae 204. The baseband receiver 202 is coupled to the 
CPU 206 to provide communication information 220 thereacross. Furthermore, the CPU is 
coupled to the memory 208 to provide stored information 222 therebetween. 

100034J The CPU 206 is further coupled to the graphics processor 216 across a bus 224, 
which one embodiment may be a flex assembly for providing communication thereacross. In 
one embodiment, the graphics processor 212 may be co;q>led to and mounted on the flex 
assembly 224 for providing for tiie efficient utilization of limited space within a typical handheld 
device. 

[000351 On one embodiment, the memory 208 may be a secured digital random access 
memory (SDRAM) or a DDR memory. It is further recognized that the memory 208 may be any 
suitable memory enable of storing information and executable instructions for the CPU 206 
therein. 

[00036] The CPU 206 and/or the graphics processor 212 may be. but not limited to, a 
single processor, a plurality of processors, a DSP. a microprocessor. ASIC state machine, or any 
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other implementation capahle of processing and executing software. Hie term processor should 
not be construed to refer exclusively to hardware capable of executing software, and may 
impUcitly include DSF hardv,are, ROM for storing software. RAM. and any other volatile and 
non^volatilc storage medium. Moreover, the memory 214 may be, but not limited to, a single 
memory. apluraUty of memory locations, shared memory, CD, DVD, ROM. BAM. EEPROM, 
optical storage, microcode, or any other non-volatilc storage medium capable of storing digital 
data for use by *e graphics processor 212. 

(000371 The graphics processor 212 is coupled to the memory 214 across the bus 226, 
wherein across the bus 226. in one embodiment, the bin identifiers and supplemental vertices 118 
may be provided to the memory 214 acting as the vertex buffer 120 of FIG. 1. Gr^hics 
processor 212 is further coupled to the subdisplay 210 across a bus 228 which may be any 
suitable uni^directional or bi-directional bus providhig for a limited output to a subdisplay 210. 
In accordance with one embodiment of the present invention, the sub-display 210 may be a 
ininiature display. simUar to the display 218, but providing for a limited output, such as a 
viewing screen for F«)viding a ten digit caller I.D. on a handheld device capable or providing for 
telephonic communications. 

(000381 nie graphics processor 212 is foither coupled to the LCD control 216 across bus 
230. which allows for the presentation of display data to be conttolled and eventoaUy provided to 
display 218 across bus 232. Similar to the embodiment described above in FIG. 2, the output 
display 158 provided ftom the ftame buffer 148 to the display 150, wherein the fiame buffer 148 
may be disposed within ihe graphics processor 212 of FIG. 3 and the display 218 includes the 
display 150havingalsotheLCDcontrol2l6forcontiollingoperationthcrein. 
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100039] The system 200 of FIO. 3 iUustratcs a general representation of an apparatus for 
graphics processing in a handheld device Jn accordance with one embodiment to the present 
invention. The CPU 206 performs specific operations and the graphics processor 212 perfoms 
other processing operations. Such that, the present invention improves graphics processing 
within the handheld device by allocating processing requirements among the various processors. 
Breaking the vertex transformations into two separate phases, the vertex transform phase and the 
viewport transform phase, allows the implementation of clipping efficiently on the CPU 206. 
The CPU 206 reads the clip buffer (not shown) disposed within the vertex buffer 120 iUustiated 
in HG. 1, to dctennine if there ai« any vertices to be cUpped. In one embodiment, the cUpped 
buffer may be disposed within the memory 208. If there are vertices to be clipped, using the 
indices, the CPU 206 generates new vertices in the corresponding screen bin identifiers and 
issues the triangle Ust. As such, suice the percentage of vertices to be clipped is relatively small, 
the overhead on the CPU 206 and the bus 224 is thereby limited- 

[000401 FIG. 4 Ulustrates graphical representation of one embodiment of a bin identifier 
250. In one embodiment, the bm identifier 250 includes 8 bits, the least significant bit 252 
representing a vertex in front of the near plane, the second least significant bit. representing a 
vertex outside the fer plane 254 and the sixth most significant bits representing an X, Y position 
256. In one embodiment, a viewing screen may be subdivided into a number of bins wherein the 
X, Y position 256 represents one of sixty-four possible bins in this embodiment Therefore, the 
6 bit flag 256 indicates the conesponding bin upon which the vertex is disposed. 

1000411 As described above, the vertex shader 102 generates the bin identifiers 250 by 
utUizing known transformation techniques, including determining a viewport and a view firame 
area relative to camera and Z axis angles. Furthermore, a cUpping region is determined by any 
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position outside the view frustrum or outside the vieN^-port region and a guard band region. In 
one embodiment, these elements may be determined by view frame factors 1 08. 

[000421 FIG. 5 Ulustrates graphical representation of a cUp ID 260 having a corresponding 
bin ID 250 and a vertex ED 262. As described above, when a^rtex is determined to be 
clippable, snch'^iting Z or X, Y region lestrictions. the clip b^enerated from the 8 bit 
bin ^d the vertex SFmough, as recognized by one having ordinary skill in the art, the bin 
t^STbe any suitable number of bits to provide for further scalabiUty of the present mvention, 
including scalability in the number of grids and aasigmnent of a bin identifier per veitex 
allocation. 

[00043] hi one embodiment, the clip ID 260 is a 32 bit flag having the bin ID 250 as bits 
0-7. me clip ID 260 further includes the vertex ID 262 which is a 24 bit identifier used to hidex 
the vertex data as stored in the vertex buffer. 

(000441 FIGS. 6 and 7 illustrate vertices before and after the clipping process. FIG. 6 
illustrates a triangle 300 having three vertices. 302, 304 and 306. The clipping region 308 is 
defined as regions outside of the viewable area, m contradistinction to the viewable area 310. 
Based on graphics processing described above, an analysis of the vertices 302, 304 and 306 will 
provide for the generation of a clip IDlojfor the vertex 306. Whereas, vertices 302 and 304 are 
wilhm the viewable area 3 1 0. so no clip IDs are generated for these vertices. 

[000451 In accordance with one embodiment of the present invention. ¥10. 7 illustrates the 
generation of supplemental vertices of 312 and 314 having corresponding bin ^0ers. The 
supplement vertices 312 and 314 are generated within the viewable tegionQlO^Sidc of the 
clippmg region 308. TTixough a calculation of the uxtersection of the edge of cUpping region 308 
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be^veen the clip^^nd&e bin I^Sprovides for the lo"=^tion o^pplenie^gg 
314. FvMhennore. a calculation of the location between the clip^B^go^ 
the intersection of the clipping region 308 provides for^ suppleniental ^^g| [:\FIG. 7 
illustrates the 2 newly generated triangles 316 andgl|P^g vertices (302. 304, 312) and (302, 
312, 3 14), respectively, within the display area and outside of the cUpping region 308. 

100046] FIG. 8 illustrates one embodiment of the vertex shader 102 coupled to the CPU 
206 and a cUp counter 330, in accordance with one embodiment of the present invention. In one 
embodiment, the vertex data 106 of FIG. 1, starts in a main memory, such as the memory 208 of 
FIG. 3 and is vmtten to an input buffer. In one embodiment, using matrix multiplication, the 
vertex data 106 may be received as a series of damjiits having the vertex information in a 
sequential order. In one embodiment, tlie CP^^^^tes dippable versions of the vertex 
iirformation to form ihe supplemental vertices which are thereupon added to the list of vertex 
information. In one embodiment, the vertex shader 102 includes a vertex data buffer 332 storing 
vertex data therein which is provided to a plurality of arithmetic logic units 334(a), 334(b), 
334(n). The aritiimetic logic units provide arithmetic operations such that the vertex data is 
transformed to be provided to an output buffer 336. The vertex shader 102 further includes a 
controUer 338 which leceives control instmctions 340 from the vertex data 332 and provides 
control information 342 to the output bii^adordance with known control techniques. 

[00047] In accordance with the one embodiment of the present invention, whenever the 
vertex data is processed by the plurality of arithmetic logic units 334 and a clip identifier is 
generated, the controller 338 increments a value within the cUp counter 330. Ths dip counter 
value 330 is thereupon provided to the CPU 206 such that the number of dipped vertices may be 
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known and thereupon the CPU 206 may know tlie proper number of cUpping operations to be 
performed to generate the supplemental vertices. 

[000481 na 9 Ulustrates the steps of one embodiment of a method for grapHcs 
processing of a handheld device. The method begins, step 400, by receiving plurality of vertex 
information, step 402. Similar to the embodiment described above in HG. I. the vertex 
irformation 106 is provided to the vertex shader 102. Hie next step, step 404, is perfomung a 
vertex transform on the vertex information to generate a plurality of vertices, wherem each of the 
vertices has a corresponding bin identifier. 

[00049] Hiereupon, the method includes comparing each of the vertices with a plurality of 
view frame factors wherein the view frame factors define a cUpping re^on. step 406. As 
discussed above, the view frame factors may include a guard band region, a view fruslrum, or 
any other suitable factors for defining the cUpping region. The next step, step 408 includes 
generating a clip identifier for each of the vertices within the cUpping region using the vertex bin 
identifier. In one embodiment, the step may be performed by the vertex shader 102. Thereupon, 
the next step, step 410 includes providing the clip identifiers to a clipping module. In one 
embodiment, the clipping modnle may be the central processing unit executing clipping module 
software thereon. 

[000501 The next step is receiving a plurality of supplemental vertices generated from the 
plurality of vertices within the clipping region, step 412. As described above, the supplemental 
vertices include newly generated vertices within the viewable region such that new triangles may 
be fomied having vertices all within the viewable region. Thereupon, the method is complete. 
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Step 414. AS recognized by one having ordinary skill in the art, the above method is an 
exemplary embodiment of the present invention. 

1000511 FIG. 10 illustrates another embodiment of a method for graphics processing on a 

handheld device, the method beginning step 420, by receiving a plurality of vertex information. 

performing a vertex transfonn on the vertex information to generate a plurality of vertices. 

wherein each of the vertices has a corresponding bin identifier, step 422. The next step is 
comparing each of the vertices with a pluraHty of view firamc factors, wherein the view ftame 
factors define a cUpping region step 424. Thereupon, step 426, a clip identifier is gene«ted for 
each of the vertices within the cUpping region using the vertex bin identifier. 

1000521 The method fiirther includes inciementhig a cUp counter value, each time a vertex 
is deemed .vithin the clipping region step 428. As described above, with regard to FIG. 8. the 
clip counter 330 is incremented by the controller for every time a clip identifier is generated. 

100053] The next step is providing the clip counter value to a central processing unit, such 
that the central processing unit knows the number of cUp identifiers to be provided to the 
clipping module, step 430. The next step is providing all the cUp identifiers to the clipping 
module and receiving a plurality of supplemental vertices generated fiom the plurality of vertices 
within the clipping region, step 432. Thereupon, the method mcludes performing a viewport 
transform on the pluraUty of vertices and the plurality of supplemental vertices, providing an 
output signal from the viewport transform to a set up engine. 434. As discussed above with 
regards to FIG. 2. The viewport transfonn may be performed within the vertex shader 102 such 
that the setup engine 142 receives the information 152. The next step includes generating a setup 
engine output and providing the set up engine output to a restoration engine, step 436. The next 
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step, step 438, is generating a rasterization engine output signal, providing the rasterization 
engine output signal to a pixel operation module, transforming the rasterization output signal to 
generate a viewable output display signal and providing a viewable output display signal to a 
frame buffer. Thereupon, the next step is providing the ^ewable display signal from the ftame 
buffer to a display device, step 440. Wherein steps 436, 438 and 440 are described ^^mher above 
^th regards to HG. 2. As such, in one embodiment of the present intervention, the method is 
complete, step 442, 

[00054] FIG. 1 1 illustrates the steps of another embodiment of a method for graphics 
processing in a handheld device. Th, method begins, step 460, wherein prior to perfomung the 
viewport transform, step 462 includes defining a pluiaUty of rendering regions and selecting one 
or more of the plurality of rendering regions such ti^at only the vertices wiUdn the rendering 
region (determined by the vertex bin identifiers) aie provided to the viewport transfonner. In 
one embodiment, the display area may be divided into multiple regions wherein each region 
defines a specific area in x and y coordinates of a viewable area. For example, a display may be 
divided into four separate regions having a cenlxal dividing line dovm the middle of the V axis 
and the middle of the axis defining four equally rectangular regions. 
I00055J The next step, step 464, is generating one of a plurality of viewable display 
signals representing a viewable portion of the rendering region, step 464. In this embodiment, 
the same rendering techniques as described above in the steps of FJG. 9 and/or 10 may be 
utilized to generate the viewable display signal, wherein the viewable area has been defined as a 
smaller area, thereby increasing the clipping region to all regions not within the selected 
rendering region, step 468. In the exemplary embodiment where the viewing area is divided into 
four rendering regions, the viewport transform would be performed four times, once for each of 
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the rendering regions. The next step, step 470, is generating the plurality of viewable display 
signals representing the plurality of viewable portion, of the rendering region. Thereupon, step 
472 is voting the fluidity of viewable display signals into the fiame buffer. As such, as steps 
468 through 472 are repeated, for all of the rendering re^ons, contained withm the ftame buffer 
is thereupon a complete viewable diqilay signal. 

(000561 Tlie next step, step 474, is providing a complete viewable display output signal 
from Ihe frame buffer to a display device. In one embodiment, the method further includes 
utili^ins a «z" buffer having a storage capacity of one or more of the plurality of rendering 
regions to generate the plurality of viewable display signals, step 476. Therefore, in this 
embodiment, there is a reduction in the required amount of storage space as a V hnf^ may be 
utilized having only storage capacity up to a single rendering region. Taking this embodiment 
fhrtfaer. in a limited memory situation, the rendering region may be dissected into a larger 
number of viewable portions to accommodate a limited buffer memory space, wherein the 
buffer memory may be dumped once the viewable display signal is written into the frame 
buffer. Thereupon, the method is complete, stqp 478. 

[00057] FIG. 12 Ulustrat«s a further embodiment of the method for graphics processing in 
a handheld device. TTie method begins, 500, by receiving a plurality of vertices. s1«p 502. The 
method further includes transforming the plurality vertices to generate a corresponding bin 
identifier for each of the plurality advertises, step 504. 

[000581 The next step, step 506, is a determination of whether each of the plurality of 
vertices is within a viewable region. When one of the vertices is not within the viewing region, 
thereby within a cUpping region, step 508 is incrementing a cUp counter. Tbe next step, step 

19 

rHir.Ar!O«l0483lS.3 

PA(X 2im < RCVD AT 2lia/20(l5 1:05:47 PM [Eastern Standard Time] ' SVRiUSPTO^ 



02/1*6/2005 12:10 TAX 312 609 5005 __YEDDER PRICE KAUF>L\N @022 



510, is generating a clip identifier using the corresponding bin identifier. Thereupon, the method 
includes clipping the vertices having the clip identified pluiaUty of supplemental vertices having 
a corresponding bin identifier based on each of the vertices having a clip identifier, step 512. 
Thereupon, step 514, includes updating a vertex list to include all of the vertices not within the 
clipping region and the newly generated supplemental vertices. 

(000591 The next step is to add the supplemental vertices to the vertex list, step 516. 
Refening back to step 506. if it is determined that the vertices is within the viewing region, the 
vertices is directly added to the vertex list, step 516. Thereupon, once the vertices are added to 
the vertex Ust. step 5 1 S is performing a viewport transform on the vertex list. As discussed about 
ynm regards to. among other places, FIG. 1, the viewport transform may be perfomied in one 
embodiment by the vertex shadcr 102 of FIG. 1. Thereupon, the method is complete, step 520. 

1000601 Th. above method may fixrther be implemented in another embodiment 
incoiporating the method of FIG. 1 1, wherein a plurality of rendering regions are defined prior to 
perfonning the viewport transfotm. 

1000611 present invention provides for graphics processing in a handheld device 

through the efficient utilization of Ihnited bandwidth and processing requirements. Though 
offloading CPU processing cycles and allowing for the efficient dipping of vertices not within 
the clipping region, but rather^ within the viewable region, a reduction in the amount of 
processing is required. Moreover, through utilizing the associated bin identifiers and clip 
identifiers in conjunction with supplemental vertices, flirthermore in conjunction with the cUp 
counter providing for indication to the CPU of tiie number of clipped vertices to be recalculated 
as supplemental vertices, the gate size of the graphics processor and memory bandwidth required 
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to render a scene can be reduced, ■nierefore. in the limited processing enviroiiment of a typical 
handheld device, the present invention allows for improved graphics rendering, improving 
processing speeds and reducing graphics chip die size, power consumption and memory 
bandwidth requirements, 

[000621 It should be understood that there exists implementations of other variations and 
modifications of the invention and its various aspects, as may be readily apparent to those of 
ordinary skill in the art. and that the invention is not limited by the specific embodiments 
described herein. For example, the generation of cUp identifiers based on a defined cUpping 

region may be performed in an "on the fly" calculation, providing for a further reduction in 

n^emory storage requirements. It is therefore contemplated to cover by the present invention. 

any and all modifications, variations, or equivalents to fell vvithin the spirit and scope of the basic 

underlying principles disclosed and claimed herein. 



21 



PAGE 23/34 'RCVD AT 211612005 1:05:47 PM [EastemStandard 



02/16/2005 12:13 FAX 312 609 5005 VEDDER PRICE K.\UF5UN 12|024 



CLAIMS 



What is clabncd is: 

1. apparatus for graphics processing in a handheld device, the apparatus 

comprising: 

at least one of: a Worm engine and a vertex shader. such Ihat the at least one of the 
ttansfonn engine and the vertex shader is capable of receiving a pluiaHty of 
vertex information and generatmg a plurality of vertices, wherein each of the 
vertices has a corresponding bin identifier, 
a pluraUty of view frame factors defining a clipping region, upon which the vertex shader 
compares each of the plurality of vertices, wherein when one of the vertices is 
within the clipping region, a clip identifier is generated for that vertex using the 
corresponding bin identifier, 
the vertex shader operably coupleable to a cUpping module on a central processing unit 
(CPU) wherein when one of the vertices is within the clipping region, the vertex 
shader provides the vertices having a corresponding dip identifier to the clipping 
n^odulc which thereupon generates supplemental vertices and the vertex shader 
receives the supplemental vertices therefrom; and 
a vertex buffer operably coupled t» the vertex shader wherein the vertex buffer receives 
ihe plurality of vertices and the supplemental vertices from the dipping module 
on the CPU end stores the plurality of vertices therein, 
2. The apparatus of daim 1 further comprising: 

a seti^ engme operably coupled to the vertex shader wherein the setup engine tec^ves 
the plurality of vertices and the supplemental vertices. 
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3. The appiiratus of claim 2 fiirflier comprising: 

a rasterization engine coupled to the setup engine for receiving a setup engine output; and 
a pLxel operation module coupled to the rasterization engine and the frame buffer such 
that the pixel operation module transforms a rasterization engine output signal and 
provides a viewable output display signal to the frame buffer. 
4. The apparatus ofdaim 3 further comprising: 

a display device operably coupled to the frame buffer such that upon generation of a 
viewable display from the viewable output display signal, the viewable display 
may be provided to the display device from the frame buffer. 
The ^atus of claim 4 wherein the display device is a liquid crystal display. 
The apparatus of claim 1 therein the plurality of view frame factors include at 
least one of the following: a view frustum region and a guard band region. 
7. The apparatus of claim 1 further comprising: 

a cUp counter having a clip counter value such that the clip counter value is incremented 
each time one of the vertices is deemed within the clipping region and once all of 
the plurality of vertices have been generated by the vertex shader, the clip counter 
value is provided to a central processor such that the central processor may 
properly retrieve the corresponding number of cUp identifiers. 
8. Ihe apparatus of claim 1 wherein the clipping module is disposed on a central 
processor and the apparatus U coupled Xo the central processor across a bu, and the transform 
engine is disposed within the vertex shader. 



5. 

6. 
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9. A method for graphics processing in ahandhdd device, the method comprising: 
recei^^Ilg a pluraUty of vertex information; 

performing a vertex tra^oxm on the vertex information to generate a plurality of 
vertices, v^herein each of the vertices has a corresponding bin identifier and a 
coiresponding vertex identifier; 
ccnparing eaoh of a« vertices «U. a plmditj, of vi<»w fi«nc fe«ots «hc«m the view 

frame factors define a clipping region; 
generating a clip identifier for each of the vertices within the cUpping region using the 

coiresponding bin identifier and the corresponding vertex identifier; 
providing the cUp identifiers to a clipping module; and 

receiving a pithily of suppl^ental vertices generated from the pluraUty of vertices 

vathin the clipping region. 
10. The method ofclaim 9 further comprising: 

incrementing a cUp counter value each time one of the vertices is deemed within the 
olippitig region; and 

providing the cUp counter value to a central processor such lhat the central processing 
knows the number of cUp identifiers to be provided to the clipping module. 

11. The method of claim 9 further comprising: 

perfomJ.^ a vi=«pon "a»ft™ ■>» P'»^ °' 
supplemental vertices. 

12. The method of claim 1 1 further comprising: 

providing an output signal ftom Ihe viewport transform to a sel^p engine. 

13. The method of claim 12 fiirther comprising: 
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g^erating a setup engine output; and 
providing the setup engine output to a rasterization engine. 
14. The method of claim 1 3 further comprising: 
generating a rasterization engine output signal; 

pro^dding the rasterization engine output signal to a pixel operation module; 
transforming tlie rasterization engine oulput signal to generate a viewable output display 
signal; and 

providing the viewable output display signal to a frame buffer. 

15. The method of claim 14 fiii4.er comprising: 

providing the viewable display signal from the frame buffer to a display device. 

16. TTie method of claim 11 wherem prior to the step of performing the viewport 

transform, the method ftirtber comprises: 

receiving a list of triangles to be rendered, wherein the triangle is defined by three 

vertices; 

for each of the triangles in the Ust of triangles, detemaining if each of the vertices 

defining the triangle is within the clipping region; and 
discarding the vertices within the clipping region thereby generating a Ust of only 

triangles having vertices within the clipping region. 
1 7. The method of claim 16 fiirther comprising: 

prior to perfonning the viewport transform, defining a plurality of rendering regions and 
selecting one of the phiralily of rendering re^ons such that only the vertices 
within the rendering regions are provided to the viewport transfoimer; 
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generating one of a plurality of vie^vabk display signals representing a viewable portion 

of the rendering region; and 
^vIiting me vie^vable display signal to a frame bufifer. wherein the viewable display signal 

is directed to the selected tendering region. 
18. The method ofclaim 17 further comprising: 

performing the viewport transform for each of the plurality of rendering regions; 

generating the plurality of viewable display signals representing the plurality of viewable 
portions of the tendering region; 

writing the pluraUty of viewable display signals into the frame buffer, and 

providing a complete viewable output display signal from the frame buffer to a display 
device, wherein the complete viewable display output signal includes the pluraUty 
of viewable display signals, thereby capable of generating a complete output 
display. 

19. The method of claim 18 ftnlher comprising: 

otilizing a 2 buffer having a storage capacity of one of the plurality of rendering regions 
to generate the plurality of viewable display agnals. 
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20. A method for graphics processing in a handheld device comprising: 
receiving a plurality of vertices; 

tranaforming the plurality of vertices to geaetate a coirespondiBg bin identifier and a 

vertex identifier for each of the plurality of vertices; 
for each vertex, detennine if the vertex is within a clipping region; 
if the vertex is within a clipping region: 
incrementing a clip counter; and 

generatmg a cUp identifier nsing the corresponding bin identifier and the vertex 
identifier; 

clipping the vertices having the clip identifier to generate a plurality of supplemental 
vertices having a corresponding bin identifier based on each of the vertices having 
a clip identifier; 

updating a vertex list to include all vertices not within the clipping region and the 

supplemental vertices; and 
performing a viewport transformation on the vertex Ust 
21. The method ofclaim 20 further comprising: 
dividing a fuU frame into a plurality of frame portions; 

defining tiie dipping region to include all vertices outside one of the pluxaUty of frame 
portions such that the vertex list only includes vertices witinn the one of plurality 
of the frame portions; 

after performing the viewport transformation on tixe vertex list, generating a viewable 

output display signal representing the frame portion; and 
writing tiie viewable output display signal in a frame bu^- 
22. The metiiod of clahn 21 fiirther comprising: 
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for each of the plurality of frame portions: 

definiDg the clipping region to include all vertices outside one of the plurality of 
f,^e portions such that the vertex list only includes vertices within the 
one of the plurality of frame portions; 
after performing the vievvport transformation on the vertex list, generating a 

viewable output display signal representing the frame portion; and 
VTiting the viewable output display signal in a frame buffer; and 
providing the plurality of viewable output display signal from the frame buffer to an 

output display. 

23. The method ofclaim 22 fiirther comprising: 

during the generation of viewable output display signal, utilizing a z buffer having a 
storage capacity of one of the pluraliiy of rendering regions to generate the 
plurality of viewable display signals. 

24. The method of claim 20 wherein the step dipping the vertices is perfoimed by a 

central processing unit. 

25. The method of claim 20 wherein the step of transforming the pluraUty of vertices 

and the step of viewport transformation is perfonned on a gr£«.hics processor. 
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26. A graphics processing device for a handheld device comprising: 

a vertex shader capable of receiving a plurality of vertex information and gener^ a 

plurality of vertices, ^.herein each of the vertices has a corresponding bin 

identifier; 

a central processing mrit coupled to a vertex shader; 

a plurality of view fr^c fectors defining a clipping region. t«,on which the vertex shader 
compares each of the plurality of vertices, wherein when one of the vertices is 
• ^thm the clipping region, a cUp Identifier is generated for that vertex using the 
corresponding bin identifier; 
a cUp counter having a cUp counter value such that the clip counter value is incremented 
each time a vertex is deemed within the clipping region and once all of the 
pluraUty of vertices have been generated by the vertex shader, the cUp counter 
value is provided the cental processor such that the central processor may 
properly retrieve the corresponding number of cUp identifiers; 
the vertex shader provides the vc^ces having a corresponding clip identifier to the 
central processor executing a cUpping module such that the cUpping module 
generates a plurality of supplemental vertices and the vertex shader receives the 
supplemental vertices firom the central processor; and 
a vertex buffer opena,ly coupled to the vertex shader wherein the vertex buffer receives 
the plurality of vertices and the supplemental vertices from the vertex shader and 

stores the pluraUty of vertices therein. 
27. nxegrt^hics processing device of claim 26 farther comprising: 
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a setup engine operably coupled to the vertex shader wherein the setup engine receives 

the pluralit>' of vertices and the supplement vertices; 
a rasterization engine coupled to the setup engine for receiving a setup engine output; and 
a pixel operation module coupled to the rasterization engine and the ftame buffer soch 
that the pixel operation module Hansforms a rasterization engine output signal and 
provides a viewable output display signal to the frame buffer. 

28. Tbe graphics processing device of claim 27 fijithcr comprising: 

a display operably coupled to ihe frame buffer such that upon generation of a viewable 
display from the viewable output display signal, tiie viewable display may be 
provided to the display from the frame buffer. 

29. The graphics processing device of claim 26 wherein the plurality of view fiame 
factors include at least one of the following: a depth factor, a view frustum region and a gu^ 
band region. 

30. The graphics processing device of clahn 26, whereinihe vertex shader includes a 
transform engine such that the transform engme generates the plurafity of vertic«^ 
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METHOD AiND APPARATUS FOR GRAPfflCS PROCESSING 
Mtinuv ji,- A HANDHELD DEVICE 

ABSTRACT 

The pi«em ta.«tfi<m iDCludes . metod and appmtus fcr graphics processtag to a 
ta«Jl»ld device including a taansform engine capable of recei™.g vertex informafion. The 
^nafonnenginegenerateaaplurality of vertices fron.d>ever»xiofoxn^cn.«h=reineachof 

a. vertices include, corresponding Wn identifier. The method a.rf.pp.ran.O.rltoi-'-J" 
view fianrefacK-rsdermingacUpping region such that when any of the phrralitj-otverSces is 

within fl»cUppingre^on..cBpid««i8e. is g««ra.<^fbr that vertexusingthe correspond 
Wnidentifier. Avert«sh«le,eo,.pledto.clipi^modnl^wh«eina«cW.«nrodule 
S^terates supplemental vertices andthe vertex shadcr receives the suppl«nen«a vertices 
thereftom. T*evert«,8had«comhines the supple-nental vertices with the bin idertifierssnd are 

provided to a vertex buffer. 
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